UK Patent Application „ GB m 2 263 002«A 

(43) Date of A publication 07.07.1993 



(21) Application No 9227180.3 


(51) INT CL 6 


ssnec 7/cn 


(22) Date of filing 31.12.1992 


(52) UK CL (Edition L) 


(30) Priority data 


G4A AAC 


(31) 820304 (32) 06.01.1992 (33) US 


(56) Documents cited 




QB 2226165 A US 3993691 A 


(71) Applicant 




Inlet Corporation 


(58) Field of search 


(Incorporated In the USA - Delaware) 


UK CL (Edition L) G4A AAC 


INT CL 5 G06F 7/50 


2200 Mission College Boulevard, Santa Clara, 


Online database: WPI 




California 95052, United States of America 




(72) Inventor 




JackTPoon 




(74) Agent and/or Address for Service 




Potts, Kerr & Co 




15 Hamilton Square, Birkenhead, Merseyslde, 




L41 6BR, United Kingdom 





(54) Parallel binary adder 

(57) An N-bit binary adder with a highly parallel structure comprises a multiplicity of parallel modulo-2 adders for forming the 
sum of corresponding operand and carry bits. The carry inputs are generated by a conditional carry propagation generator 
and an unconditional carry generator from which carry bits are generated In log 2 2N operational levels. 




2/20 




FIG. 2 

(PRIOR ART) 




FIG. 3 

(PRIOR ART) 



4/20 




«j» 4l— .,3. 



81 




5/20 




FIG. 7 



6/20 




FIG. 8 



7/20 




ir 13 



FIG. 9 



n ,60 




9/20 




10/20 



A N-I- 

B N-r 



A|- 

Ao" 



C N-I 



SUM UNIT 

m 



CARRY 
GENERATOR 

m 



"Sq 



A N-I B N-I 



A| B, A 0 B 0 

FIG. 13 





PROPAGATION 




GENERATOR 






A N-I B N-I 


- 1 1 1 1 

A| B| A 0 B 0 



1MIW 



2| 



V y 1 



5 



2| 



o 



l/in 



I 

CM 



4h 



I 

4 



S 

in 



JL 

CM 

i 



£ J 



12/20 




13/20 



T 



T 



o — evi 10 1 in vq oq. o>. % = Q ^ 
tjts (so Itf o o o 'o o o o lo o a u 



_L 



_L 



(9 
flT" 



nD 



To" 

i« 

I 
I 
I 

1r 



i - *3 

„4>^ ... 



I 
l 
I 



1 

S 1 



-1=1 



.§3^.3..i g .— -j 



! 
I 
t 
I 



r 2~ 



i 

i 
i 
I 

~T 
I 
I 
I 
I 
i 

.J. 
I 



3 



, f „„. 



l==f 

— s3 



4V i=£j - - 



I 
I 

I _ 
-J. 



- IS" 

- I=S 
!- 1=5! 



— N=2 
e i 

oi r 



- *3 

— fc=<Hl 

I _ZJ 



1 o - ~ ! < ia «l i-j os, a 7Z m I a e 5 ffi 

OO o 'O O O O jo O O O jo O O O 



L 



J. 



_L 



5 esj S J3" 
e> 19 o o 



O O O C9 



e3 esJ <\X cJ" 
e> o cd o 



^ JO irt 
cvJ csl csT Pd 
ID O 19 O 



3 

w 



£>1 

s 



5 1=5 



8 g5 

1=5 
(=5 

j|§ | 

8 



3 



T r _ 



=Tg- 



§ J?3 

feS 



Si 



O 19 <9 t9 



5 

C9 



3 

C9 



O 



16/20 




17/20 




63 62 6| Go C_| 

FIG. 20 




A3B3 A 2 6 2 A|B| A 0 6 0 

FIG. 21 




G 7 G 6 G 5 G/, G 3 G 2 G| G 0 C.| 

FIG. 22 



19/20 



C7 

a CARRY-OUT 




P7P6P5P4 



FIG. 23 



2263002 



fi pAflft! LEL BINARY ADDER 

F1FI.D OF INVENTION i 

The Invention pertains to the field of arithmetic adder circuits and 

more specifically to binary adder networks. 

5 RAHKCiRQlJND TO THE INVENTION 

Binary adder networks are basic to digital computer arithmetic 
operation. Because of the large number of adder operations Involved, the 
history of computer development shows a constant search for faster adder 
networks either through faster component technology or by improved 
1 0 network organization using various auxiliary logic or computing networks to 
augment the basic adder unit 

Early digital computers used rippte-cany adders in which the ii& adder 
output bit may be represented by the modulo-2 bit sum 

s r A . eB < ec «-i 

1 5 where Aj and Bj are the i& bit of the input operands, and Cm is the carry-in 
from the next lowest bit sum. The carry-in may be represented in terms of 
the prior stage operands (Am, Bm) and the prior stage carry-In, Q.2. as 
Cm« Am • Bn + Cj-2 (Am + Bm) where (-, +) are Boolean (AND, OR) 
operators respectively. The time for the carry-bits to ripple through became 

20 the limiting factor in the speed of adders. Subsequent fixed-time adders 
were introduced to overcome these deficiencies. These fixed-time adders 
may be classified into two categories: conditional sum and carry-look-ahead 
(CLA) adders. 

Conditional adders compute each bit sum, S|, twice: one sum, Sni. 

25 based on the assumption that the carry-in bit, C|, is zero; a second sum, Sb, 
on the assumption that d - 1 . Figure 1 is the logic diagram of a 4-bit-sllce 
conditional sum adder. (Ref. "Introduction to Arithmetic M , Waser and Flynn, 
Holt, Rinehart and Winston, 1 982, p. 77ff). The two input operands are 
represented by input bits Ao, Ai, A2, A3 and Bo, Bi, B2, B3, respectively. 

30 Each pair of operand bits (Aj, Bj) are applied to input terminals 110. Ao, Bo 
correspond to the input operand least significant bit while A3, B3 correspond 
to the most significant bits. The conditional sum adder consisls of two basic 

1 



sections: tho conditional sum generator unit 130 that forms at its output the 

two sets of conditional sums apd conditional carry, Sno. Sni, Sn2.Sn3, Cm 

and Seo. Sei, Se2i Se3, Ce4. the latter group being based on the 

assumption of a non-zero carry-In to its corresponding individual conditional 

5 sum generator 141, 143, 145, 147, 149, respectively. These conditional 

signals are applied to conditional sum selector unit 150 consisting of the 

individual output selectors 161, 163, 165, 167, 169 corresponding to output 

sum bits So, Si, S3 and output carry bit C4. The selection Is controlled 

by the carry-In bit, Co, and its complement, Co, operating on the 

1 0 conditional sums by means of AND-gales 1 1 3 and OR-gates 1 1 5. 

The logic equations governing ihe behavior of the conditional 4-bit 

slice adder of Figure 1 are as follows: 

S= A A ©B A 
NO 0 0 

s »s 

E0 NO 

s r A i eB i $G o 

S E1- A i eB i eP 0 
S^=A 2 ©B 2 0(G 1+ T 1 G o ) 

s E2 =A 2 eB 2 e ( G 1 + T l P o) 

S N3= A 3 eB 3 e ( G 2 +T 2 G 1 +T 2 T 1 G o) 
S E3= A 3 eB 3 e ( G 2 +T 2 G 1 +T 2 T 1 P 0) 
C N4 =G 3 +T 3 G 2 + T 3 T 2 G 1 +T 3 T 2 T 1 G 0 



C E4 =G 3 + T 3 G 2 +T 3 T 2 G 1 + V2 T 1 P 0 



where 



p r A i B r 
15 V A i eB r 

The true 4-bit sum and carry-out Is selected by selector unit 150 in 
accordance in accordance with the following boolean equations: 

50 « SeoCo + Sno Co 

51 a SeiCo + Sni Co 
20 S2 = SE2C0 + Sn2 Cp 

S3 = SE3C0 + S N 3 Co 

C4 = CE4C0 + CN4 
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The above concept could be extended to additional bits with the 
attendant increase in complexity implied by the above equations and by 
Figure 1. 

Carry-looks ahead (CLA) adders have been the most popular 
5 integrated circuit implementation in the recent past because of their 

simplicity and modularity. Modularity implies relative ease in extending the 
number of bits in each operand by the use of identical parallel units. 

Consider, for example, the 4-bit slice CLA of Figure 2. Comparison 
with Figure 1, a 4-bit slice conditional adder, clearly shows the relative 
10 simplicity of the CLA. 

The CLA sum may be expressed In the following logic expression as 

s=a eB ec 

°l I i 1-1 , 1=0, 1,2,3 

and the CLA carry as 

C| + AiBi + Ci (A| + B|) 

15 or Q = Gi + P|C| 

where Gj=*AjBi 
and Pi = Aj + B\ 

The above CLA sum expression can be immediately evaluated, 
absent the carry term (Cm), by forming the EOR of the two operands (A|, Bj). 

20 The carry term, Cm. is a function of lower order indexed operands, 

(Am, Bm), and lower order carries, C}-2. As a result, the time to complete an 
addition is generally governed by availability of the carry-in bit to each sum- 
bit. 

The above expression for Ci is a recursive equation, i.e., one in which 
25 the current value, Ci+i, Is a function of its own past values. It may be 
explicitly stated as follows: 

C|+i = Gi + P|Gm + PjPmGi-2 + - + PjPi-i .» P<A) 
Hence, for the four-bit case of Figure 2, the major output carry, C4, may be 
expressed as 

30 C4 = G 3 + P3G2 + P3P2G1 + P3P2P1G0 + P3P2P1 P0C0 

By substituting the following into the above expression 
Go' = G3 + P3G2 + P3P2G1 + P3P2P1G0 
3 



and Po^PaPgPiPoCo obtains C4«Go' + Po'Co 

which represents the logical expression for the Go\ Po' output terminals of 

Figure 2. 

If two networks of the type shown in Figure 2 were to be used as a 
5 modular units for generating an 8-bit sum, the carry-in bit to the higher order 
4-bit network, C4, would have to be formed in accordance with the above 
expression. The output carry of the higher order unit, C8, would then be 
expressible as 

C 8 » GV + Pt'Go' + Pt'Po'Co 
1 0 where Q' and p \ are the CLA output pair of the next higher order CLA 
modular unit. 

Modularity was extended by means of a four group CLA generator 
that accommodated four CLA 4-bit slice adders and produced at output the 
necessary carry information, i.e., C 4 , C 8 , C12 and P\ G n t to form a 16-bit CLA 
1 5 adder using four modular adder units of the type shown in Figure 2. Figure 3 
shows a four group CLA generator with four input pairs. 
( G 0' P n) '( G ^;) '(Vol and ( G o - p ^ 

v o o/ \ 1 v v 2 2/ \ 3 3/ and carry outputs corresponding 
to C4, Cs, C12 and {P\ G"), where 



20 and 



G i 2 =G ' 2 +p, 2 G 'i +p 2 p ; G o +p, 2 p ; p o c o 



G" = Q' + P' G' + P' P' P'C 

3 3 2 3 2 10 
p.. = p . p. p. p . 

3 2 10 



Thus, the most significant carry-out bit, C16, could be logically formed as 
C 16 = G" + P" C 0 

and passed on, as needed, to higher order modular CLA adder units. 

25 Figure 4 shows the logical extension of modular CLA concept to 

64-blt addition. A total of sixteen modular 4-bit slice SLA adders 200 are 
arrayed In parallel to accept Input operand pairs. (A 0 . B 0 ) ... (A3. B 3 ), (A 4 , B 4 ) 
... (A7. B 7 ), .... (A 60 . B 6 o) ... (A 63 , B 63 ) and carry-in bits, (Co. Cie. C 32 . C 4 b), 
each producing 4-bit output sums. (S 0 . Si, S 2 , S3) ... (S 6 o. S 6 i, S 62l S 63 ) . 

30 and carry-generate/carry-propagale pairs ( P ' 0 > G ' 0 ) ••••( P i5 ,G i 5 ) 

4 



A second logical level of four modular four group CLA generators 
250, each accepting the carry putput information of a corresponding group of 
four CLA adders 200, generates the necessary carry information for its 
associated adders 200 from the four pairs of carry-generate/carry-propagate 
5 pairs and the necessary carry-generate/carry-propagate pairs, 



consisting of a single CLA generator 250 generates the three additional 
carry-in bits, (Ci6> C32, C48) supplied to the first and second levels. In this 
manner, modular 4-bit slice CLA adders have been used to accommodate 

1 0 higher precision operation. 

Also, the basic conditional adder unit of Figure 1 may be used as a 
modular adder and extended to higher precision addition by using the CLA 
generator concept because the logic equations defining the higher order 
carries are similar. For example, it may be shown (op cit Waser and Fiynn) 

1 5 that the second level conditional same carries may be expressed as 



Because the logic required to implement the above expressions is identical 
20 to that of the CLA generator 250 of Figure 3 and 4, a 1 6-bit adder may be 
implemented as shown in Figure 5. The adder has four conditional adders 
100 connected in parallel, each accepting 4-bit pairs of operands. Each 
adder 100 consists of a conditional sum generator 130 and a multiplexor 
150. The modular group cany-out pairs, [(Cn4» Ce4). (Cn8i Ces), (Cni2. 
25 CE12)]. a r © supplied to CLA generator 250 which produces the modular 
carry-In bits (C4, C8, C12) required to form the sixteen bit addition. The 
extension required to accommodate more bits clearly indicated by the CLA 
method previously discussed. 



30 desirable that the number of processing steps required to generate the 
carry-bits (and hence the sum) be proportional to the logarithm of the 
number of bits in each operand, and at a relatively low-cost. Also, a logic 
structure that allows constant fan-in and fan-out and permits static versus 




I. from which the third logic level 



C4 « Cm + Ce4 Co 

Ce « Cns + Ceb Cn4 + Ces Ce4 Co 

C12 ■ CN12 + CE12 CN8 + CE12 Ceb Cn8 + Ce12 Ce8 Ce4 Cq 



Because of the need for cost effective parallel fast adders, it is highly 
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fixed rale pre-charge/discharge operation is desirable. The present 
invention is designed to achieye these goals. 
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SUMMARY OF THF IMVFMTIOM 

A parallel N-bit binary qdder network is described comprising a 
multiplicity of parallel modulo-2 adders, each accepting and summing 
corresponding operand bit pairs and a final sum carry input. The final sum 
5 carry bits are generated in parallel by a carry generating network that 

comprises a parallel carry propagation logic array for generating conditional 
carry propagation terms based on the logical OR-Ing of pairs of input 
operand bits, an unconditional carry generation logic network based on 
AND-ing of pairs of input operand bits, and a logic array for operating on 
1 0 conditional and unconditional carry terms, in parallel for producing a set of 
final sum carry terms that are fed in parallel to the modulo-2 parallel adders. 
The number of gate delays for generating the final set of N sum carry Inputs 
to the modulo-2 adder is (log22N], providing a substantial increase in adder 
throughput. 
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BRIEF DESCRIPTION OF THF DRAWIN^ 

Figure 1 1s a logic diagram of a prior art 4-bit slice conditional sum 

adder. 

Figure 2 Is a logic diagram of a prior art 4-bit slice carry-look-ahead 
(CLA) adder. 

Figure 3 is a logic diagram of a prior art four group CLA generator. 

Figure 4 is a block diagram of a prior art 64-bit adder using full CLA. 

Figure 5 Is a block diagram of a prior art 16-bit conditional sum adder 
using a four group CLA generator. 

Figure 6 is a flow diagram of a four-bit carry process. 

Figure 7 is a flow diagram of an eight-bit carry process. 

Figure 8 is a flow diagram of a sixteen-bit carry process. 

Figure 9 is a logic diagram for a typical carry generator node 
implementation. 

Figure 10 is a block diagram of an 8-bit carry generator. 

Figure 11 is a logic diagram for an 8-bit propagation generator. 

Figure 12 is a logic diagram for a one-bit adder with carry input. 

Figure 13 shows a block diagram of a complete parallel adder. 

Figure 14 shows the structure of an m-bit two-level carry generator 
module. 

Figure 15 shows the interconnections for a 16-bit carry-generator 
using 4-bit two-level modules. 

Figure 16 shows the partitioning of a 16-bit first and second level 
carry-generate matrix for use with 4-bit two-level modules. 

Figure 17 shows the partitioning of a 16-bit third and fourth level 
carry-generate matrix for use with 4-bit two-level modules. 

Figure 18 shows the interconnections for a 64-bit carry-generator 
using 8-bit two-level modules. 
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PFTA11 FP DESCRIPTION OF THE INVENTION 

The sum, S, of two N-tyt binary number operands (A,B) where 

A»An.i.An-2, .» .Ao 
B = Bn-i.Bn.2. — Bo 
5 may be expressed as 

S = Si,Sn-2. So 
where W B i 0( Vi 

represents the value of the sum bit expressed as the modu!o-2 sum of the 

Hh operand bit values (Ai, Bi) and the carry-in bit Ci-1 . from the modulo-2 

1 0 sum of the next least significant bit pair (An , Bn). Thus, by using the 

boolean logic operators (•) for "AND" and (+) for "OR", the carry bits may be 

expressed as 

Co = Ao * Bo 
Ci » Ai • Bi h 
15 C2 = A2*B2 + (A2- 



Ci » Ai • Bi + (Ai + Bi) • Co 
"i+B 2 )*Ci 



C| = AcB| + (Ai + Bj)*Cm 

I 

Cn-1 » An-i • Bn-1 + (An-1 + Bn-1) * Cn-2 

20 For convenience, let 

Gi = ArBi 
P| 5 =A I + B| 

so that the above carry bit expression become 

0 0 

C =G +PC 

1 110 

2 2 2 1 
25 I 

C M.,-Vl + P M+t°H.t 
(Note that for further convenience, the explicit "AND" operator symbol has 
30 been omitted so that P| CisPi • Cj). This convention will be used throughout 
the following description. 

The above recursive expressions may be expanded as follows: 

Co = G 0 
Ci = Gi +P1G0 
35 C2 = G2 + ?2 G1 + P2 Pi Go 

C3 = G3 + P3 G2 + p 3 p 2 Q1 + p 3 p 2 Pi Go 

C4 = G4 + P4 G3 + P4 P3 G2 + P4 P3 P2 Gl + P4 P3 P2 Pi Go 



C| = G| + Pi Gm + Pi Pm Gi-2 + P| Pm Pi* Gi- 3 
+ ... + P|Pn Pj.2... Pi Go 

1 I 

This set of equations may, In turn, be expressed in matrix form as 



N-1. 



p p 

2 1 

3 2 1 



1 

P, 



P P 

2 1 



P P 

L N-1 N-2 

or simply 



P- P 



N-2 N-3 



N-3 N-2 



1 

P- 



P 1 1 J 



N-1J 



C=P< N >fl 

where a is the carry column vector, 

g is the carry generator column vector, 
and P(N) is the lower triangular NXN cany propagation matrix. 

Thus, fl = [G 0 Gi G 2 ... G N -i] T * (A 0 B 0 A1B1 A 2 B 2 ... A N -iBn-i] t 
represents the "AND'Mng of operand bit pairs which generate a carry-out 
when high. Matrix P, whose elements represent propagation control 
variables, describes the means by which the carry-outs are propagated to 
and through higher order bits. 

Significantly, the P-matrix may be factorized into the product of sparse 
lower triangular matrices. For example, 
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Thus, at each binary increment, 2 k £ r < 2 k+ \ pM Is factorable into 
(k+1) lower triangle matrices of the form shown. These faciorized equations 

5 may be represented by the flow diagrams of Figure 6. 7 and 8. 

Figure 6 corresponds to the four-bit carry propagation process 
represented by the factorization of P( 4 >. The input to the process consists of 
the carry-generator vector, [Go Gi G2 G3) T shown at the bottom. The 
diagonal lines with arrow-heads correspond to multiple ("AND 11 ) operations 

10 on the data of the node of origin by the corresponding labelled expression. 
Unlabeled vertical lines between nodes represent transmission paths with 
no modification of data transmitted from a lower node to a higher node. All 
nodes are summing ("OR") junctions. For example, C1 = Gi + P1G0 and C3 

= P3P2 (Gi + P1G0) 
15 + (G 3 + P3G2) = P3P2P1G0 + P3P2G1 + P3G2 + G3. 
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The carry-out vector, [Co Ci C2 C 3 lT, is represented by the values present at 
the upper output nodes. j 

Figure 7 and 8 show flow diagrams for P(&> and PM6) t respectively 
representing 8 and 16 bit carry generation processes. Clearly, flow 
5 diagrams for greater number of bits may be generated in a similar fashion by 
extending the principles expounded. 

For each binary increment, 2 k £ r £ 2 K+ 1 - 1 t 0 r for each doubling of 
the number of bits used in the operands, one additional sparse lower 
triangular matrix is required to represent the factorized form of the PW matrix. 
1 0 Thus, for 2 £r £3, PW factors into 2 matrices; for 4 Sr £7, PW factors into 3 
matrices, and for2 k *r£2i<+M, P(') factors into (k+1) matrices. 

Each factorized matrix operation corresponds to a row of nodes 
shown in Figures 6, 7, and 8. The lowest (zero) level nodes correspond to 
the input carry generate vector values, g. The values at the next level of 
1 5 nodes corresponds to the column vector that would obtain if the extreme 
right hand factorized matrix of the examples given above were to operate on 
the input generate vector, g. Similarly, the second level of nodes has values 
corresponding to that which would obtain if the second most extreme right 
had factorized matrices operated on the vector resulting from product to its 
20 right. And so on for succeeding levels. 

In general, k + 1 factorized matrices (stages) are required for2 k + 1 bits 
in each operand, i.e., [log2N] stages for N-bit operands. 

The flow diagrams of Figures 6, 7 and 8 also imply the logic network 
structures shown in Figures 9 through 11. 
25 Figure 9 represents a typical nodal processor 10 located at, say, node 

I, k of Figure 8 producing Gi^ at its output. Processor 10 accepts as input 

operands GM,k-2M,Q Mik and P k P k-t '" P k _ 2 l-1 at Hs input terminals 

I I , 12 and 13 respectively. B AND M -gate 16 and "OFT-gate operate on these 
inputs to produce at output 14 the boolean function 

30 k-2 1-1. k-2 
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Figure 10 is an embodiment of an 8-bit carry generator having four 
rows (0-3) and 8 coiumns {0-7}. Rows 1 through 3 comprises 7, 6 and 4 
nodal processors 10, respectively, each of the type depicted In Figure 9. 
Row 0 comprises 8 AND-gates 20 arranged to accept at input terminals 301 
5 corresponding operand bit pairs, {Ar, Br), forming Go.k = Ar • B k and supplied 
to processors 10 on One 1 1. The processors 10 of row 1 also accept the 
seven propagation variable P, through P7 on input lines 305. Propagation 
variable Pk being applied as an input to processor 10 located at row 1, 
column k on line 13 together with Go, k-1 supplied by lines 12. The output of 
1 0 processor 1 0 located at 1 ,k Is 

Gl,k=Go.k + PkGo.k-l 
In a similar manner, processors 10 of row 2 are supplied the outputs 
of row 1 together with propagation variable P21 through P76 from input line 
307. The output of processor 10 located at 2, k is 
1 5 G2,k = Gi, k + PkPk-1 Gi, k -2 

Processor 10 at location 3, k in a similar manner generates an output 

G3JC = G2, k + PkPk-lPk-2Pk-3 G2,k-4 
from inputs provided by lower level processors and propagation variable 
P4P3P2P1 through P7P6P5P4 supplied on input lines 309. 

20 Carry output Co is available directly from AND-gate 20 at location 0,0 

on line 303; C1 from output line 14 of processor 10 at location 1,1; C2 and 
C3 from processors 10 at location 2,2, and 2,3 respectively; and C4 through 
C7 from row 3 processor 10 outputs. 

It is clear, by reference to the flow diagrams of Figures 6, 7 and 8 and 

25 carry generators 300 of Figure 10, that the architecture and organization of 
the 8-blt carry generator 300 may be expanded Indefinitely adding an 
additional row each time the number of bits in each operand is doubled. 
The number of parallel processors required in each row is summarized in 
Table I. 
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Table I 



Figure 11 is a iogic circuit for implementing an 8-bit propagation 
generator suitable for supplying propagation variables to the 6-bit carry 
generator of Figure 10. Propagation generator 400 comprises 7 OR-gates 
40 in row 0 used to form propagation variables Pi, P 2 , .... P 7 from input 
operand bit pairs {Ak, Br} as follows: 
Pk = Ak+Bk 

The set, {Prf, is available on output lines 307. Subsequent rows are 
comprised of AND-gates 50. The kJb AND-gate of row 1 accepts the Mb and 
k-1ib output of row 0 to form at its output 307 P k P k -L Similarly, the kth 
processor of row 2 accepts the kill and k-2& output of row 1 to form the set of 
propagation variables, {P k P k -1 Pk-2 Pk-a}. provided at output 309. 

Clearly, the organization and architecture of processor 400 may be 
extended to accommodate more operand bits by extending the structure of 
Figure 11 to the left and adding an additional row of AND-gates 50 each 
time the number of input operand bits are doubled. The number of gates 
required per row are indicated in Table II. 
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Table II 



Figure 12 represents a logic network 60 for forming the complete bit 

sum of two operand bits (Ak, Bk) and a carry-In bit Ck comprising exclusive- 

or (EOR) networks 61 and 62. EOR network 61 forms the modulo-2 sum 

5 A R 0 B k and network 62 produces at its output 

S = A e B ec 
°k k D k k-1 

Based on the preceding descriptions summer network 60, carry 
generator 300 and propagation generator 400, a complete parallel binary 
adder may be defined as shown in Figure 1 3, organized to accept two N-bit 
10 operands 

A « Ao Ai A2 ... An-i 
B = Bo B1 B2 ... Bn-1 
Operands A and B are applied to the inputs of propagation generator 400, 
carry generator 300 and sum unit 500. Propagation generator 400 and carry 

15 generator 300 are configured in accordance with the prior description. Sum 
unit 500 comprises N one-bit plus carry-in bit EOR networks 60, each as 
described In Figure 12. The carry-in to each EOR network 60 is provided by 
the appropriate output terminal of carry generator 300. Propagation 
variables are provided to carry generator 300 by propagation generator 400 

20 as determined by the two input operands A and B. The output of sum unit 
500 is 

S = So Si ... Sn«i 

where 
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-s k =A k eB k ec 



k-1 



10 



15 



20 



25 



Note that carry Cn-1 is avallab|e at the output as an overflow bit of for use in 
extending the number of bits In the operands A and B. 



medium scale Integrated circuit technology. For example, by properly 
sub-sectioning the flow graph of Figure 8 Into seven subsections as shown 
by the dotted outlines, a 4-bit wide and 2-level deep module may be defined 
that forms the basis for a modular building-block approach to the circuit 
implementation. The 4-bit wide partitioning is somewhat arbitrary and is 
mainly chosen for purposes of explanation because it probably represents 
the lowest level of modularization that allows the principle of modularity to 
be described 

Figure 14 Is a block diagram of an m-blt wide, 2-level module 500 
comprising two layers of m nodal processors 10 of the type shown In Figure 
9. Five sets of m-input lines are accommodated: Inputs 501 accept the 
corresponding l-level outputs, {G| ik }; inputs 503 accept the l-level outputs 
displaced by 2^, {Gi fk - 2 M }; inputs 505 and 507 accept the conditional 



accept the (1+1 )ib inner layer output terms (displaced by 2 1 ), {G| +1fk . 2 i}. 

Two sets of output lines are provided: outputs 51 1 correspond to the 
first layer output terms, {Gi + i.k}; and outputs 513 are the second layer (or 
module) outputs, {G|+2,k}. 

Figure 15 is an Interconnection diagram for a carry generator 300 
using 4-bit wide (m«4) 2-layer modules 500. Each logic unit 520 represents 
a set of four unit 20 AND-gates used to form (Gr). 

Figure 15 may be best understood by referring to Figure 16 that 
shows the matrix equation relating the zero level (l=0) Inputs, {G 0 *}, to the 
second level (l«2) outputs, {G z tf, and to Figure 17 showing the matrix 
equation relating the second level outputs to the fourth level outputs, {Ga,u). 



In Figure 16, the two 16x16 matrices ( p 1 . ? 2 J are each partitioned into 
16 4x4 submatrices. Each non-zero valued submatrix corresponds to a 



The preferred implementation of carry generation 300 uses modular 
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single layer 4-bit wide operation performed within a 500 module. The 
submatrices of the right-hand piatrix correspond to first layer operations 
while those in the left-hand matrix correspond to the second layer operations 
previously described. Similarly, the right hand set of submatrices in Figure 
5 1 7 corresponds to third level (l=3) operations and the left set corresponds to 
fourth level (1=4) operations. These equations provide interconnect 
information by relating the individual module 500 inputs to their outputs. 

For example, consider the input/output relationship of module 500 In 
the first row of Figure 15 identified by coordinates (1 ,3). 

10 G2 i 8-11=P2.32'Pl t 2lGo i O-3+P2,32'Pl,22'G0 r 4.7+P2,33Pl.32Go.4-7+P2.33G0,8-11 

Because P2,32 4 Pi,21=0 

G2,8.1l=P2.32*Pl ( 22Go.4-7+P2 > 33Pl i 32G0.4-7+P2 i 33Go,8-11 
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This latter equation expresses the required inputs to module 500 
(1 ,3): the first expression on the right implies only two non-zero products 



andG 



i.7;as inputs; 



7^8 ° 1 ' 6 and s^g G t 7 thus requiring ' ^ • G 1t 6 

the second expression requires 8 ^ ' Jj 0 ' : and the third requires the input 

quadruplet [G 0t8 Go,9 G 0| io G 0 ,ii] T , and the triplet ■ J. 
Summarizing, the required inputs are: Go.8-11. Gi,6-7, Qo.7. and 
9 - 1 1^9 _ 1 1 ■ as shown ln R 9 ure 1 5 - (Please note that for Figure 1 5, the 
output carries, {CjO, are equal to {Qa % ^.) 

A similar analysis for module 500 (2,4) results in the following 
expression: 



2,12 
a 2,13 
3 2,14 
3 2.15 



The interconnections shown in Figure 15 for module 500 (2,4) result. 
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Figure 18 shows a simplified Interconnection diagram for a 64-bit 
carry generator using 3-layersjof 8-bit wide two layer modules. Specific 
details of the Interconnections may be obtained by partitioning the 
carry-generator 300 matrices in the same manner as shown for the 4-bit 
5 wide two layer example. For the 64-bit case, however, three sets of 
equations, corresponding to the three layers of Figure 18, must be used. 

Another preferred embodiment using a slightly different concept of 
modularity is shown in Figure 19. For purposes of explanation, a 24-bit 
adder network is shown comprising: three 8-bit conditional adder networks 

10 141 each accepting two eight bit operands [(Aq-7, B0.7), (A8-15, Be-is), 

(Aie-23, B16-23)]. and each outputting two conditional 8-bit sums (Se.Sn) as 
previously described in Figure 1; multiplexer units 160 for selecting the Se 
or Sn output of each conditional adder unit which is controlled by a two state 
carry signal; carry and propagation generator units 600 each comprising a 

1 5 carry generator 400 for accepting two 8-bit operands and producing at its 

output the highest carry, say, out of a possible set of (Co, C1 C7) for 

controlling its associated 2:1 MUX 160. Note that the lowest order (extreme 
left) MUX 160 is shown dotted so as to indicate that modularity consideration 
may require that each 8-bit conditional adder 141 be packaged with an 

20 associated MUX 160, in which case its control but would be set low because 
the absence of an input carry makes the Sn output always valid. In effect, 
each of the three vertical grouping of units 141, 160 and 600 constitute a 
modular adder and carry-out generator 700 requiring its associated two 
fields of operand bits and carry-in bit. The tandem ensemble of these units 

25 makes-up the complete adder. The output sum is represented by the 25-bit 
sum S0-7. $8-15, S16-23. S24. 

In order to accommodate the carry-in bits (C-1, C7, C1 5) to units 600, 
a slight modification of the basic matrix and flow diagram must be made. 
Consider, the unit 600 shown on the extreme right of Figure 19. The 

30 requisite matrix has the form 
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Note that if the carry-in, C.i, fs zero (non-existent), the first row and column 

are zero. Also, Po=C-i so that PO and all its product terms vanish if C-i=0. 

Thus, when C-i=0, networks 300 and 400 are as previously defined. If Ci=1 
5 when the form of the matrix, carry-generator network 300 and propagation 

generator 400 have the same logic structure as previously described. 

For example, Figure 20 shows the flow diagram corresponding to a 

4-bit carry-generator 300 with an input carry bit d, suitable for 

concatenating 4-blt conditional adder units in a similar fashion to that shown 
1 0 for 8-bit conditional adder units 141 in Figure 19. The necessary steps 

required for generating the output carry, C3, are shown by solid lines while 

the dolted-lines represent the other possible, but not required, processing 

steps previously shown. This implies the carry-out generator structure 300' 

shown in Figure 21 using processing modules 10. 
1 5 Figure 22 and Figure 23 are the corresponding flow diagram and 

simplified carry-out generator structure 300 for an 8-bit unit respectively, as 

used in the adder network of Figure 19. 

Figure 24 shows a simplified block diagram for propagation generator 

400' suitable for use with the 8-bit adder module 700 of Figure 19. The 
20 subset of propagation terms required the 4-bit carry-out generator 300' 

( p 3 p 2 p i p o. p 3 p 2. p 3. p i) is also available from this unit. 
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The same flow diagram and logic networks are applicable to all 
concatenated units 600 of Figqre 19. However, in the case of the lowest 
significant unit 600 at the extreme right of Figure 19, the carry-in from the 
previous stage is non-existent so that C-i*0. For the other stages, the 
5 carry-out of the previous section is used as the carry-In. 

Clearly, the concept of modular carry propagation for extended 
operand precision, an example of which is shown in Figure 19 Is adaptable 
to the use of 4, 8, 1 6, ... or any other size modular bit units by implementing 
units 600, 160 and 141 for the word size desired. Also, mixed systems in 
1 0 which associated units 600, 1 60, and 1 41 of a given 700 section, are of the 
same word size, but not necessarily the same word size the other 700 units 
tandemly connected with it, can be constructed. 

These and other similar variations will become apparent to those 
versed in the art. 
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CLAIMS 



1. A parallel carry generating network for use In a muttibit binary 
5 adder comprising: 

(a) means for accepting a first and second N-blt binary operands; 

(b) carry propagation logic network array for generating conditional 
carry propagation terms by parallel logical operations on pairs of 
corresponding bits of said first and second operands; and 

1 0 (c) carry generator logic array for generating final sum carry bits, 

comprising: 

i) logic network for generating unconditional carry terms by 
parallel operation on said pairs of corresponding bits of said first and second 
operands; and 

1 5 ii) logic network .array for accepting and operating on said 

unconditional carry terms and said conditional carry terms for producing a 
parallel set of final sum carry terms. 



2. A parallel carry generating network as in Claim 1 wherein said 

20 carry propagation logic further comprises: 

(a) a zero level (1=0) multiplicity of N-1 gates for the logical OR-ing of 
said corresponding bit pairs for all said bit pairs except the least significant, 
producing at each gate output, its associated first level conditional carry term 
arranged in ascending order of said operand bit pair inputs; 

25 (b) a first level (1=1) multiplicity of N-2 AND-gates, each said gate 

accepting as Inputs adjacent overlapping ascending order outputs 
producing an ordered ascending set of second level conditional carry terms; 
and 

(c) additional higher levels of AND-gates, each succeeding level 
30 having N-2 ! AND-gates, each said gates having a first input connected its 
corresponding lower level ordered output and second input connected to the 
2<ltt lesser ordered output of said lower level, for forming the required higher 
level up to and including level. I=L, for which 2' * N/2; each level producing 
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succeeding higher order conditional carry terms at the individual AND-gate 
outputs. i 

3. A parallel carry generating network as in Claim 1 wherein said 
5 carry generator logic array further comprises: 

(a) a zero level (l*0) multiplicity of N gates in parallel for the logical 
AND-ing of said corresponding operand bit pairs producing at each gate 
output the unconditional carry term associated with said operand input bit 
pair, said outputs being arranged in ascending order corresponding to the 

1 0 ascending order of said operand input bit pairs; 

(b) carry generator processor having a first second and third input, 
said first and second inputs connected to distinct partial carry terms of the 
same next lower level and said third input connected to an associated 
conditional carry term provided by said conditional carry generator for 

1 5 producing at its output a next level partial carry term; 

(c) a first level (1=1) multiplicity of N-1 processors arranged in 
ascending order each having a first, second and third input, each said first 
input connected to one corresponding zero level processor outputs for all 
outputs except that associated with the least significant of said input bit pairs, 

20 said second input of each processor connected to the output of the zero 
level gale associated with the next least significant of said input bit pairs, 
each said third processor Input connected to the corresponding zero level 
output of said carry propagation logic, producing at a each processor output 
a partial carry term of second order arranged in ascending order; and 

25 (d) additional higher levels (I = 2, 3 L) of processors each level 

having N-2 M processors, each said first input connected to one said 
corresponding next lower level processor output for all outputs In ascending 
order beginning at the 2 M output, said second input of each processor 
connected to the output of the next lower level processor at 2 M positions 

30 below said corresponding next lower level processor input to said first input, 
and said third input connected to the corresponding lib level conditional 
carry generator output. 

- 25 



4. A parallel carry generator network as in Claim 3 wherein said carry 
processor comprises: j 

(a) OR-gate with an output and a first and second input ; said first 
input connected to said processor first input, said OR-gate output being said 

5 processor output; and 

(b) AND-gate with Its output connected to said OR-gate second input, » 
with a first and second input, said first input connected to said processor 

second input and said second AND-gate input connected to said processor 
third input 

10 

5. A paraliei adder network comprising: 

(a) parailel sum unit having first, second and third set of inputs, said 
first and second input set for accepting a first and second input operand, 
said third input set connected to the output of a paraliei carry generating 

15 network, for accepting parallel .carry bits and producing the sum of the input 
operands. 

(b) parailel carry generating network having a first and second 
operand and generating at its output terminals a parallel set of carry bits. 

20 6. A parallel adder network as in Claim 5 wherein said parallel sum 

unit further comprises a multiplicity of sum units, one for each operand bit 
pair comprising adder means for forming the modulo-2 sum of each 
corresponding operand bit pair and cany bit. 

25 7. A parallel adder network as in Claim 6 wherein said adder means 

comprises exclusive-or gates. 

8. A carry generating module comprising: 
(a) an equal number of first and second level carry generator 
30 processors each having a first, second and third input, said first and second , 
input connected to distinct partial carry terms from the corresponding next 
lower level output, said second input connected to associated displaced 
partial carry terms from the next lower layer output and said third input 
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connected to an associated conditional carry term from the next lower layer 
output, for producing at each said carry generator processor output a next 
level partial carry term; 

(b) means for connecting said output of each said first level carry 
5 generator processor to a corresponding said second level processor first 

input; 

(c) means for connecting said first and second level processor 
outputs to a first set of external terminals; 

(d) means for connecting said first and second level processor 
1 0 second and third inputs to a second set of external terminals; and 

(e) means for supporting said network as a unitized structure. 

9. A multibit adder network comprising: 

(a) modular adder and carry-out generator unit capable of accepting 
1 5 two fields of operand bits and an input carry bit for producing the sum and 
output carry, said modular adder and carry-out generator comprising a 
conditional sum adder, conditional sum selector means controlled by the 
input carry, and a parallel carry generator unit for generating said output 
carry from said operand bits and input carry bits; 
20 (b) multiplicity of said modular adder and carry-out generator units 

connected in tandem so that the prior modular adder and carry-out 
generator carry-out is connected to the control input of said conditional sum 
based on a carry-in is selected when said prior carry-out is asserted, 
otherwise selecting the other conditional sum, said selected sums and final 
25 carry-out being representative of the desired adder output. 

10. A modular multibit adder network as in claim 9 wherein each said 
modular adder and carry-out generator units accepts two input operand 
pairs comprising a like number of bits. 

30 

1 1. A modular multibit adder network as in claim 9 wherein each said 
modular adder and carry-out generator units may each accept input operand 
pairs of differing numbers of bits. 

27 



12. A modular carry propagation unit for use in a modular multlblt 
adder network comprising: 

(a) a simplified carry generator for generating an output carry bit from 
the two sets of operand bits and a carry-in bit comprising a minimum set of 
processor elements necessary to generate said output carry bit, controlled 
by an associated minimum set of propagation variables; and 

(b) a simplified propagation generator for generating propagation 
control variables from the two sets of operand bits and a carry-in bit 
comprising a minimum set of logic elements necessary to generate said 
minimum set of propagation variables needed to control said simplified carry 
generator. 
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